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ABSTRACT 



For a dynamical system far from equilibrium, one has to deal with em- 
pirical probabilities defined through time-averages, and the main problem is 
then how to formulate an appropriate statistical thermodynamics. The com- 
mon answer is that the standard functional expression of Boltzmann-Gibbs 
for the entropy should be used, the empirical probabilities being substituted 
for the Gibbs measure. Other functional expressions have been suggested, 
but apparently with no clear mechanical foundation. Here it is shown how a 
natural extension of the original procedure employed by Gibbs and Khinchin 
in defining entropy, with the only proviso of using the empirical probabilities, 
leads for the entropy to a functional expression which is in general different 
from that of Boltzmann-Gibbs. In particular, the Gibbs entropy is recov- 
ered for empirical probabilities of Poisson type, while the Tsallis entropies 
are recovered for a deformation of the Poisson distribution. 
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1 Introduction 



Classical equilibrium statistical mechanics is a well established subject. One 
deals with a dynamical system defined by a Hamiltonian function H on a 
phase space ^A , which is provided with a suitable probability measure (typ- 
ically Lebesgue measure). Having fixed a value U of the mean energy, one 
then correspondingly defines a conditional probability (the Gibbs measure) 
on A^. To obtain statistical thermodynamics, the classical procedure of 
Gibbs and Khinchin (see ^) consists first of all in identifying the external 
work 6W as 5W =< df^H > d k, where < • > denotes average with respect 
to Gibbs measure, and the Hamiltonian has been assumed to depend on an 
external parameter k. A corresponding expression for the exchanged heat 
6Q is thus obtained, as 5Q = dU — 6W, and this finally allows to introduce 
entropy in the standard thermodynamic way. 

However one often deals with situations in which the relevant proba- 
bility measure is different from that of Gibbs. Typically this occurs when 
the probability is defined dynamically in terms of time-averages (sojourn 
times) and the final time is not long enough for equilibrium to have been 
attained (statistical mechanics far from equilibrium). In such a case, follow- 
ing Poincare, Boltzmann and Einstein one can assume that the expectations 
should be computed in terms of time-averages. 

The problem is then to determine which is the correct expression for 
the thermodynamic functions in such a situation, and for this it is sufficient 
to provide an expression for the entropy. Many people accept the thesis 
that entropy should be defined by the classical formula of Gibbs with the 
only proviso that the empirical probabilities (time-averages) should be sub- 
stituted for the Gibbs measure. Other people, particularly Tsallis, suggest 
instead that different formulae should be used, but a mechanical foundation 
for such formula is apparently lacking. 

In the present paper we show how the standard procedure to define en- 
tropy recalled above can be implemented also if, in the expression for the 
external work, the averaging is performed through the empirical probabili- 
ties rather than through Gibbs measure. In particular, the standard Gibbs 
entropy is obtained for empirical probabilities of Poisson type, while the 
Tsallis entropies are recovered for a deformation of the Poisson distribution. 
The implementation of the Gibbs-Khinchin procedure requires however to 
previously define a suitable mathematical setting for dealing with the em- 
pirical probabilities. This is dealt with in the first two sections. The ther- 
modynamics is then developed, and the two cases leading to the Gibbs and 
the Tsallis distributions are finally considered. Some open problems are also 
mentioned. The style of the paper will be quite informal, with no striving 
for strict mathematical rigour. 
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2 A priori probability 



Suppose one has a map (f) : A4 ^ A4 (for example the time-flow of an 
autonomous Hamiltonian system at time At) in a given phase space M, 
and we are interested in computing time-averages of a dynamical variable 
A{x) (a real function on M): 

■ 1 ^ 

^(xo) = ^2^A(x„) for iV»l, 

n=l 

the sequence {xn} being defined by the recurrence Xn+i = (pixn)- The 
number N thus plays the role of the "final" time, and since now on will be 
thought of as a fixed parameter. One can divide the space M into a large 
number K of disjoint cells Zj (such that M. = UZj), and one has then 

K 

i=i 

where Aj is the value of A in a point x € Zj, and nj is the number of 
times the sequence visits Zj. It is clear that nj depends on xq so 

that, if a certain probability distribution is assigned for the initial data 
Xq, correspondingly nj turns out to be a random variable with a certain 
distribution function F.j{n), which will depend both on the dynamics (i.e. 
on the map i^) and on the distribution of the initial data. So one can speak 
in general of the "a priori probability P that the cell Zj will be visited a 
number of times rij < n:" 

P{nj <n) = Fj{n) . (1) 

In the following, in order to simplify the discussion, it will be supposed that 
such a probability does not depend on the cell Zj (i.e. Fj{n) = Fin) Vj). 
The general case can also be easily dealt with. With an abuse of notation 
we will denote by F{nj) the probability that the cell Zj is visited a number 
of times < n^, following the common attitude of using the same letter both 
for a random variable and for its value. 

A central point in the discussion is that the time-average A{xq) is itself 
a random variable, so that it is meaningful to consider its expectation. De- 
noting by < • > expectation with respect to the a priori distribution, one 
has then 

1 ^ 

<i>=-^A,<n,> . 

A step forward is made if one supposes that the expectation of nj can 
be computed by a formalism analogous to the Grand Canonical one, i.e. if 
one introduces the 
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Hypotesis 1 The quantities nj are independent random variables, condi- 
tioned by^Uj = N. 

Using this hypothesis, the expectation of A can be computed as 

1 / • • • n,=7V E,n,^, d F(ni) • . . d F{nK) 



N !-!^^^^^dF{n,)---dF{nK) 
In particular, introducing the function 

Z{\,A) = J ■■■ J e-^^^^"^^^dF(ni)---dF(nK) , 
which generates the moments of the random variable A, one obtains 



(2) 



<A>=-^dxlogZiX,A) 



(3) 

A=0 



3 Conditional Probability and Large Deviations 

Usually in statistical thermodynamics one does not deal directly with the a 
priori probability, because it is generally assumed that the time-average of 
a certain macroscopic quantity, typically the total energy, has a given value, 
which should play the role of an independent variable. So we consider the to- 
tal energy, which we now denote by e, and its time-average e = 1/N '^ejuj, 
and we impose on the numbers ni, • • • , uk the condition 

1 ^ 

j^H^i^i = U = const . 

Actually in such a way one meets with a large deviation problem, because 
one usually also assumes that one has 

1 ^ 

U ¥^<£>=j^^£3 <nj> , 

and furthermore that U — < e > is large, against the law of large numbers. 
One is thus in an extremely unlikely situation, where the information on 
the value U of e turns out to have a great relevance, by conditioning the 
expectations of the other quantities of interest. The problem we discuss in 
the rest of the present section is indeed how to compute such a conditional 
probability, or a posteriori probability, as it is also called. 
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Using the tool of the moment function, the a posteriori expectation of 
A, denoted hy < A >u, is given by 



<A>u=-^dxlogZ{X,A,U) 



where the moment function Z is defined by 



(4) 

A=0 



Z{X,A, U) = J dAexp{-XA) J ■ ■ ■ J dF{ni) ■ ■ ■ dF{nK) 

R 

-^(^ - ^ E "^J^MU - 1 ^ ejnj)S{N - ^ n,) , (5) 

d{x) being the usual Dirac's function. Here, use was made of the familiar 
formula for the conditional probability, namely the formula P{{A)\{B)) = 
P{{A)n{B))/ P{{B)), where the event (B) consists in having fixed the time- 
average of the energy, e = U. 

To compute the moment function we use the familiar representation for 
the Dirac's function, i.e. J^j^dkexp{ikx) d{x) as L — oo. One has then 

L L 

Z{\A,U) = Jixri j dki J dk2 j dA exp(- A^) 

-L ~L R 

j ■■■ j dF{ni)---dF{nK) e-x.^(i{U - ^^ejnj)ki+i{N -^nj)h 



L L 



= lim dki d k2 e:>^p(ikiU + ik2N) 

L^+ooJ J 
-L -L 

J ... |dFM•••dF(nx)^exp(-%(^ + ^^+^fc2)) • 



Notice that in the last integral there appears the Laplace transform of the 
distribution function F{n). Now we make the following 

Hypotesis 2 Defining x(z) hy 

+00 



j exp(-zn)dF(n) = exp(x(2;)) , 







we suppose that x{^) analytic in the half plane > (this is true if the 
random variables uj are supposed to be infinitely divisible). 
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Then one has 



Z{X,A,U) = 

L L 



^hm^y dki J dk2 exp likiU + ik2N + Y^x[^ +ik2 



(6) 



An asymptotic expression for the integral © can be given through the 
steepest descent method. One has to find the values of ki and ^2 which 
satisfy the system 



(7) 



One can show that such values exist provided one makes the further assump- 
tion (to which we plan to come back in the future) 

U <<e> . 

It then turns out that such values are imaginary, so we denote them by 
iki = 6 and ik2 = a. In addition, one also has to compute the determinant 
det H{9, a) of the second derivatives, which gives 

(8) 

One has thus 

exp (eU + aN + Y. x{^ + '-^ + o)) 
Z{X,A,U)c^ ^ , ^ (9) 

Obviously, to obtain bounds on the errors, one should give an estimate 
of the quantity det -ff(0, a), but this depends on the functions e and x{z). 
Prom now on, we suppose that the denominator in Q can be neglected in 
the computations of the expectations. This will be checked to be true, in 
all the examples to be considered below. Thus, for the expectation one has 
the formula 



< A>u = -dx\ogZ{\A,U)\x=o 



N 



A=0 
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which finahy, using 0, becomes 



<^>-=-^E^.x'(f + «) . (10) 

With this result, the form of the expressions ((TJ and (|lUp shows that the 
quantity —x'isjd/N + a) plays the role of the mean number of times the 
cell Zj is visited. Indeed, in terms of the quantities 

z^/=i:'-x'M/iV + a) , (11) 

such relations can be written as = X^^jj ^ ~ Yl^j^jl^ ^^"^ < ^ >U= 
Yl-^j^jl^ ■ quantity Dj may be called the mean occupation number of 
cell Zj. 



4 The Thermodynamics 

The expression ()lUp solves the problem of computing the conditional expec- 
tation, but one can ask for the meanings of the two quantities 6 and a. Let 
us consider the problem of ^. To this end it is convenient to introduce as 
an indipendent variable, instead of Zj, the quantity Uj = —x'izj), and this 
naturally leads to introducing in place of x its Legendre transform h defined 
as usual by 

h{uj) = (i^i^i+x(^i))L^=_^,(,^) • 

Notice that, while Uj has the meaning of a mean occupation number (con- 
ditioned on U), the quantity Uj just plays the role of a parameter, in the 
same sense as zj does. In particular, the quantities uj do not need satisfy 
any condition related to normalization, or the fixing of an energy value. One 
has then 

Proposition 1 The values Vj = —x'isjO/N + a), i = 1, . . . , K , correspond 
to a maximum of the function 

K 

S{u,,...,UK) = ^h{u,) , (12) 

i=i 

constrained to the surfaces "^J^j = N and ^^jT-'j/N = U. 



def 

Proof. One simply considers the function G = S — {0 /N) ^ EjVj — a ^ Vj, 
and from dy.G = one gets 

ftp 

h'{i?,) = ^ + a. (13) 
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But now, from the Legendre duality, one has 

from which the thesis follows. One easily checks that one actually deals with 
a maximum. Q.E.D. 

As a corollary there follows that the maximum of S (divided by A^) is indeed 
the thermodynamic entropy, and 6/N the inverse temperature, or at least an 
integrating factor of the exchanged heat. In other terms, the thermodynamic 
entropy S^^{9, a) turns out to be given by 

S''^{e,a) = Siuu...,i^K)/N, 

up to an additive constant. 

In fact, suppose now that the values £j depend on some external param- 
eter, say K. Then df^ej is the reaction force needed to keep the parameter 
fixed when the system is in cell Zj of the phase space, so that the quantity 
di^Ej d K is the (instantaneous) work performed on the system when the ex- 
ternal parameter is changed by a quantity d/t. For the macroscopic work 
SW, namely the expectation of the time-averaged instantantaneous work, 
using relation (|1U|) one then obtains the expression 

j 

By definition, the exchanged heat is then 
SQ = dU-6W 



N 

On the other hand, from (^Sj) one has £j/N = {h'{i>j) — Oi)/0, so that one 
finds 



3 

'J 



3 



because in our hypotheses N is kept constant, so that '^duj = 0. Concern- 
ing the physical meaning of a, we have no clear idea at the moment. 
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5 The Gibbs distribution 



The most natural choice for the distribution function that one might consider 
is that of Poisson, namely 



k<nj 

where p is a positive parameter. This corresponds to assuming that the 
successive visits of a given cell are independent events. 
The Laplace transform then has the form 

T 

/ e'"" d F = ^ e-""i-y = exp(pe-" - p) , 
"=o 

which in particular exhibits the well known fact that the distribution is 
infinitely divisible. One has thus 

X{z) =pe"^ -p , 

and 



-X (-^ +a) =pe 



The condition ^ i^j = N then gives pe~" = N/Z{P), where Z(P) ^ e~^''^ 
is the usual canonical partition function, and /? = 6/N. Thus, for the mean 
occupation number one has the usual Gibbs formula 



With this result, the value of the denominator in Q can be computed, 
and one gets 

7V2 ^ \ N ^ N ^ J ^ \ N ^ N ^ 

m 

a quantity which tends, for A ^ 0, to the familiar expression for the canon- 
ical specific heat Cy. So, if the specific heat Cy is an extensive quantity 
of the same order of magnitude as U, as usual for the systems dealt with 
in statistical mechanics, it follows that the steepest descent method gives 
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the correct answer. In addition, in computing the expectation of A the 
denominator gives a contribution equal to 



dx log Cv 



dxCv 



Cv 



which is of order 0(1) for quantities A of macroscopic type, and so can be 
safely ignored. 

Coming finally to the computation of the entropy, one first has to com- 
pute the quantity /i(i/) = uz+pe^^ —p using u = pe~^, i.e. z = — logz/+logp, 
so that one has h^v) = —x'iz) = — z^logi^ + + ulogp — p. Thus for the 
entropy one finds the expression 



i.e., up to an additive constant, the classical expression of Boltzmann (or 
rather the analogous one of Gibbs involving the global phase-space). 

6 The infinitely divisible distributions and the 
Tsallis entropies 

The infinitely divisible distributions are characterized according to the fol- 
lowing theorem (see the classical handbook 2 of Feller) 

Theorem 1 The function exp(x(-z)) is the Laplace transform of an in- 
finitely divisible distribution if and only if x{^) = o'^^ (~l)"x^"K-2) > 0; 
Vz e R+ and Vn G N. 

Perhaps, the simplest choice consists in considering the inverse powers of z. 
Taking into account normalization conditions, one is then naturally led to 
consider the one-parameter family, parameterized by 7 > 0, which, for any 
positive constant p, is given by 



Notice that it reduces to the Poisson distribution in the limit 7 +00. 
The mean occupation number uj is then given, using (|1H) . by 



Now, it turns out that this coincides, with a suitable relabeling of the pa- 
rameters, with the Tsallis distribution of index q (see jS]), 



5 = - ^ I/, log + iV(l + logp) - Kp , 



3 




(14) 




C(l + /?,(g-l)e,-)^ 
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if one sets 

1 

7 = 7 ■ 

q-1 

Notice that, in terms of the parameter q, the one-parameter family (|14() can 
also be written in a form reminiscent of the Poisson distribution, namely 

X{z) =pe~^ -p , 

where is the q deformation of the exponential introduced by Tsallis, 
namely 

= + . 

The fact that the mean occupation number, as a function of energy, is 
a g-distribution is due to the fact that the entropy (fT2|) corresponding to 
distribution (|14p essentially coincides with that of Tsallis. In fact, from p4p 
one has 



-7-1 



7, 

so that, computing Zj as a function of i^j, one gets 

J- _j_ 

K^j) = (7 + l)Vj^^P''+^ - ^Vj - p . 

Apparently, the entropy S = tlius obtained does not have the 

form of the Tsallis entropy, even if 7 is expressed in terms of q as above. 
However the coincidence (apart from an inessential multiplicative constant) 

is obtained if the present entropy is written in terms of the quantities pj *== 

and if the mean energy is computed in terms of "escort" 
probabilities (see jlj), namely pJ, i.e. essentially our v'j. Indeed, in terms of 
the variables pj and of the parameter q the present entropy S reads 

5(pi,...,p^) = C(g)5^^^^, 

where we have introduced the constant C{q) *== pq'^''~^^ / . This expression 
differs from that of Tsallis only by a multiplicative constant, so that obviu- 
osly both entropies produce by maximization the same distribution. 

In a such a way the expressions of the Tsallis entropies are recovered. 
The main difference seems to be that the present procedure, in virtue of 
its statistical-mechanics foundation, does not require any jumping between 
probabilities and escort probabilities. 
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7 Conclusions 



We believe we have given a consistent procedure to define a thermodynamic 
entropy for off-equihbrium situations. A characteristic feature of the present 
approach is that it fohows the same scheme of Gibbs and Khinchin, which 
consists in finding an integrating factor for the exchanged heat, with no 
mention of information theory at all. We are well aware however of the fact 
that another problem remains open, because in general there exist infinitely 
many integrating factors, so that a further requirement is needed in order to 
uniquely determine the thermodynamic entropy. This fact was particularly 
emphasized for example by Ehrenfest and Caratheodory (see [Hj). Notice 
that such a requirement involves the question of the extensivity property 
of entropy. These interesting problems are left for possible future studies. 
Another subject for further study is the dependence of the entropy on the 
number N of iterations, i.e. on time. 

The second peculiarity of the present approach concerns the essential 
ingredient through which probability was introduced, namely the probability 
distribution F{nj) for the number of visits of cell Zj^ against the common 
use of interpreting the numbers Uj/N themselves somehow as probabilities. 
It has been shown how thermodynamics turns out to be defined in terms of 
the function x{z) uniquely associated to the distribution F{nj). Notice that 
the treatment developed here can be easily extended to the nonhomogeneous 
case, in which the probability distribution F depends on cell Zj. In such a 
case the function x(-z) too depends on j as does the function h. One has 
then for example Vj = —x'j{£jO/N + a) and S(z/i, . . . , vk) = ^ji^j)^ 
everything remains essentially unchanged. 
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